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Abstract 


This empirical study investigates the effect of online textual, pictorial, and textual pictorial 
glosses on the incidental vocabulary! learning of 90 adult elementary Iranian EFL learners. 
The participants were selected from a pool of 140 volunteers based on their performance on 
an English placement test as well as a knowledge test of the target words in the study. 
Afterward, they were randomly assigned to 3 groups of 30 and subsequently exposed to the 
research treatment. During 3 sessions of instruction, 5 computerized reading texts including 
25 target words were studied. The participants read the texts for comprehension and, at the 
same time, were able to consult the glosses attached to the target words. Having read each 
text under each research condition, the participants were tested on their incidental 
vocabulary learning through two research instruments, word and picture recognition tests. 
The results of a one-way ANOVA analysis of the data indicate that a combination of text and 
still images resulted in significantly better incidental vocabulary learning, confirming the 
Dual-coding Theory (Paivio, 1971, 1990). 

Introduction 


Research suggests that a large portion of the vocabulary children learn in their LI is 
incidental in nature, a by-product of reading (Huckin & Coady, 1999) or listening (Nagy, 
Anderson & Herman, 1987) which provides at least three benefits for language learners: 

1 . a richer grasp of the contextual meaning and use 

2. the concurrency of the two activities (e.g., reading/listening and vocabulary learning) 

3. a more learner-centered learning process. 
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Likewise, it is generally accepted that a considerable percentage of learners’ L2 vocabulary is 
acquired incidentally. Huckin and Coady (1999) highlight the importance of incidental 
vocabulary learning by referring to several studies indicating that learners gain more 
vocabulary knowledge through extensive reading with guessing at the meaning of unknown 
words. 

However, despite the obvious advantages, there are also a number of disadvantages for 
incidental vocabulary learning. For example, research suggests that contextual information is 
often unclear for language learners to make correct inferences (Bensoussan & Laufer, 1984; 
Mondria & Wit-de Boer, 1991), leading to learners’ making wrong inferences and, thus, 
running the risk of learning words incorrectly. Interestingly; however, one of the ways such a 
disadvantage might be alleviated is by using marginal glosses (Hulstijn, Hollander & 
Greidanus, 1996), which has proved quite effective in printed materials. 

Despite the mixed views (James, 1996) towards the potentials of Computer Assisted 
Language Learning (CALL), one can consider the element of time being highly influential in 
judging technology related issues. According to Jones (2000), the availability of many 
current electronic resources provides numerous opportunities for making texts more 
comprehensible to learners. Indeed, one of the recent developments in making texts more 
comprehensible to readers is using computerized glosses/annotations. 

Glosses 


According to Lomicka (1998), the concept of glossing “dates back to the Middle Ages when 
students struggling with a foreign text, usually Latin, produced them as they moved along 
during the reading process” (p. 41). They are “typically located in the side or bottom margins 
of a page, [and] are most often supplied for unfamiliar words, which may help to limit 
continual dictionary consultation that may hinder and interrupt the L2 reading 
comprehension process” (p. 42). However, this learner-oriented technique was soon adopted 
by teachers and pedagogues so that they could present a short definition or note for unknown 
words to facilitate the reading comprehension process for L2 learners. The issue of glossing 
is by no means a medieval phenomenon now. Leloup and Ponterio (2000) refer to the current 
status of glossing as: “[T]he cues that appear when the reader clicks on the glossed 
vocabulary take various forms. Some are text explanations only, generally using a 
combination of target language and English words. Others are pictorial representations of the 
meaning of the word or phrase” (p. 7). 

Roby’s (1998) taxonomy of the present types and significance of glosses in teaching, serves 
to almost comprehensively depict the different layers of such contemporary teaching aids. In 
fact, nowadays researchers consider the usefulness of glosses as a point of departure, and it is 
investigating the different types that constitute much of the current research (Yoshii, 2006). 

Therefore, the following presents a brief review of the related literature on the effectiveness 
of non-CALL and CALL glosses as used in vocabulary acquisition research. 
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Non-CAT J> Glosses 


Concerning the possible usefulness of glosses in assuaging the disadvantages of incidental 
vocabulary learning, several studies have investigated glossed printed materials. Working 
with American students studying Spanish as an L2, Jacobs, Dufon, and Fong (1994) found 
that the performance of the gloss condition was significantly better on a vocabulary test 
administered immediately after the treatment. This study also compared the effectiveness of 
LI and L2 glosses which found no significant difference between the two. 

Further supporting evidence comes from Hulstijn, et al. (1996) who conducted their research 
with Dutch students learning French as an L2. This study showed that having access to LI 
marginal glosses was more effective than using bilingual dictionaries or, similarly, having no 
access to dictionaries or marginal glosses. 

Watanabe (1997) investigated how text modification and task would affect incidental 
vocabulary learning. This study, which was carried out with Japanese university students, 
indicated that the use of L2 glosses in the texts helped the participants retain more 
vocabulary compared to when they worked with texts containing no modifications, or 
appositives. This study also established no significant difference in the effectiveness of LI 
and L2 glosses. Furthermore, the research compared single- and multiple-choice glosses. The 
participants were required to choose the correct definition from the two alternatives offered, 
which revealed no significant difference in the effectiveness of the two types. However, this 
finding might be slightly different from what Nagata (1999) revealed based on a Japanese 
courseware program called Banzai Readings (please see the section on CALL glosses). 

Working with American students learning German as an L2 in a second semester course, 
Kost, Foss, and Lenzini (1999) compared three types of LI glosses, Text-only, Picture-only, 
and Text-and-Picture. The results indicated that the Text-and-Picture (combination) condition 
was the most effective of the three types. A similar study by Yoshii and Flaitz (2002), by 
comparison, examined the learners’ incidental vocabulary learning through incorporating the 
task into an online computerized environment (please see below). 

CALL Glosses 


Chun and Plass (1996) investigated the effect of multimedia annotations on the incidental 
vocabulary learning of 160 university German students. Using their computerized reading 
program, they conducted three studies, employing a within subjects design. The students 
used the same version of the program and worked with the program in a realistic L2 learning 
situation. Afterward, the participants were tested on their incidental vocabulary learning and 
overall reading comprehension while being free to choose the available annotations. The 
results indicated that “the recall protocol for visual annotations (that is, words annotated with 
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text and pictures, text and video) was higher than for words annotated with text alone” (p. 
189). 

Lyman-Hager and Davis (1996) carried out an experiment using their interactive reading 
program, employing two conditions: computerized reading, and non-computerized reading. 
The first group had access to multimedia annotations while the other group consulted printed 
text with the same glosses. After the experiment, a written recall protocol as well as a 
vocabulary quiz of the target words provided the researchers with the conclusion that 
students who worked with the multimedia program were better able to retain vocabulary 
words than students who worked with non-computerized text. 

Nagata (1999) examined the single- or multiple-choice glosses as used in a Japanese 
courseware program. The single-gloss version of the program provided an English translation 
for each target word or grammatical structure in the reading text, and the multiple-choice 
version provided two alternative translations in a multiple-choice format followed by 
immediate feedback on the participants’ choice. The results revealed that the multiple-choice 
condition significantly outperformed the single-gloss condition, since it helped learners with 
deeper lexical processing as well as feedback on the errors. The findings of this study can 
well be compared with those provided by Watanabe (1997, please see the previous section). 

Al-Seghayer (2001) examined the effect of dynamic video or still pictures on vocabulary 
learning. Thirty participants studying at an American university participated in the study. The 
students were exposed to one of three conditions: textual gloss alone, textual gloss and still 
pictures, and textual gloss and dynamic video. The participants were subsequently evaluated 
on their vocabulary gains through recognition and production tests. The results indicated that 
when learners looked up a combination of video clips and text definitions, they learned 
u nkn own vocabulary items better than when they looked up definitions alone or in 
combination with still images. 

Investigating the effect of visible/invisible li nk s on L2 reading, De Ridder (2002) conducted 
a study with advanced learners of French as a second language. The research was carried out 
under two conditions, the visible, and invisible links. In the former, the students read the text 
with access to highlights on glossed words and, in the latter, the learners were presented with 
the same text but with no highlights on the glossed words. The results indicated that the 
participants’ clicking behavior resulted in higher vocabulary gains and, incidentally, did not 
impair reading comprehension. Furthermore, the two groups were not significantly different 
in comprehension level, but merely different in their vocabulary gains. In addition, the results 
of a delayed vocabulary test showed that no significant differences existed in the 
performance of the two groups. 

Yoshii and Flaitz (2002) studied the effect of annotation type on learners’ incidental 
vocabulary learning. There were 151 adult ESF learners at beginning and intermediate 
language proficiency levels in their study who read a short story under one of three 
conditions: text-only, picture only, or text-and-picture (combination). In these three 
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treatments the glosses were attached to the verbs in the reading text. Having read the text, 
the participants were tested on their vocabulary gains by taking immediate, and delayed word 
and picture recognition, as well as definition-supply tests. The results indicated that the 
combination group outperformed the other two groups on all measures both in immediate and 
delayed tests, even though the differences were smaller in delayed tests. This study was a 
replication of the study by Kost et al. (1991), which was reported in the section on non- 
CALL studies. 

A study carried out by Yeh and Wang (2003) investigated the effect of three types of 
multimedia glosses, text-only, text and picture, and text, picture and sound, on the incidental 
vocabulary learning of 82 university students in Taiwan. In addition, the researchers used 
both LI (Chinese translation) and L2 (English explanation) in textual glosses. The results 
indicated that the combination of text and picture was the most effective type of annotation. 

Yoshii (2006) compared the effectiveness of LI and L2 glosses on the incidental vocabulary 
learning of 195 Japanese university students. There were four groups in the study, LI -text- 
only, L2-text-only, LI -text-plus-picture, and L2-text-plus-picture. The research instruments 
were immediate and delayed definition-supply and word recognition tests. However, the 
results indicated that there were no significant differences between the two language gloss 
types. Significant differences were found between picture (text-plus-picture) and no-picture 
(text-only) glosses for definition-supply test. Delayed tests, on the other hand, showed that 
the L 1 text-only group outperformed the L2 text-only and L2 text plus picture groups in 
recalling the target words. 

A more recent study in the field has been carried out by Yanguas (2009) following the 
theoretical framework of attention (Robinson, 1995). Applying four treatments, namely 
textual, pictorial, textual plus pictorial, and a control condition for comparison, with 94 
students of fourth semester college-level Spanish, he used think-aloud technique, reading 
comprehension, recognition, and production measures to investigate the effects of different 
types of multimedia glosses when the goal was comprehension of a computerized text. The 
results indicated that first of all, all the multimedia groups outperformed the control group on 
noticing and recognition measures. Secondly, there was no significant difference in the 
performance of the groups on the production measures, finally, the combination group 
outperformed all other groups on the comprehension measures. The results of this study 
suggest that a combination condition is ideal for text comprehension. 

Overall, the studies reported here assigned a positive role to CALL in improving the quality 
of (incidental) vocabulary learning. Consequently, this study, in line with the theoretical 
framework of Dual Coding Theory (Paivio, 1971, 1990), attempts to shed light on the 
effectiveness of textual, pictorial, and textual pictorial glosses in the incidental vocabulary 
learning of adult Iranian ELL learners at the elementary level. The present study, hence, 
attempts to address the question: “Is there any significant difference in the incidental 
vocabulary learning of the participants when exposed to three different modes of multimedia 
annotations in the course of reading?” 
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Method 


Participants 

The participants were 90 (n=90) male Iranian EFL learners enrolled in an English as a 
Foreign Language (EFL) course in Iran, who were selected from an initial pool of 140 
volunteers. They were invited to participate in the research by an announcement at a private 
institute, recruiting Interchange intro students. Although the participants were purportedly 
homogeneous in terms of their perceived level at the Interchange course, they were given an 
additional standardized English placement test. They ranged in age from 16 to 22 and had 
scores ranging from 101 to 109, which indicated that they were elementary-limited users, 
based on the OPT Language Level specification, or A2 Waystage in keeping with the 
Common European Framework. Moreover, the participants were assessed based on their 
knowledge of the target words in the study. Therefore, besides being homogeneous in terms 
of the level of English language proficiency, total lack of familiarity with the final pool of 25 
target words constituted the second criterion for participant selection. 

Materials 


The reading passages used in this study were selected from the book Communicative 
Reading Skills (CRS) based on the materials prepared by Root and Blanchard (2004) and 
edited by Mirhassani and Alavi (2004). The texts were checked against Flesch readability 
formula to guarantee the readability level of texts (please refer to Appendix B and C). The 
texts used in this study, therefore, enjoyed the readability levels, “fairly easy” and “standard” 
based on the readability of texts which had been studied by the participants in the interchange 
course. The target words were the focus of instruction in the passages selected. 

In order to gloss the target words in the three modes of instruction, it was required that clear 
definitions as well as pictures be provided. The textual definitions were extracted from 
Oxford Learner s Dictionary (1991) and the pictorial definitions were extracted from the 
Internet. Not only was great care exercised to find clear and contextually-appropriate textual 
and pictorial definitions, but further these selections were evaluated by two raters. The 
following table demonstrates the guessability of pictures as determined by the raters: 
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Table 1. Inter-rater Reliability of Pictorial Cues 


Pictorial 

Cues 

Correlation 

Coefficient 

Shared 

Variance 

Pictorial 

Cues 

Correlation 

Coefficient 

Shared 

Variance 

Convertible 

0.99 

0.9801 

Teammate 

0.85 

0.7225 

Pony 

0.99 

0.9801 

Referee 

0.99 

0.9801 

Cone 

0.98 

0.9604 

Coach 

0.92 

0.8464 

Snack 

0.95 

0.9025 

Athlete 

0.91 

0.8281 

Cookies 

0.99 

0.9801 

Bike 

1.00 

1 .0000 

Pet 

0.97 

0.9409 

Competition 

0.87 

0.7569 

Tornado 

0.94 

0.8836 

Ocean 

0.97 

0.9409 

Storm 

0.97 

0.9409 

Brain 

0.99 

0.9801 

Funnel 

1.00 

1.0000 

Octopus 

0.99 

0.9801 

Basement 

0.92 

0.8464 

Crab 

0.99 

0.9801 

Floor 

0.98 

0.9604 

Jar 

1.00 

1 .0000 

Ravine 

0.89 

0.7921 

Mammal 

0.90 

0.8100 

Disc 

0.94 

0.8836 





Reading each text, the participants had the option to consult definitions of the target words by 
placing the mouse pointer over the bold-faced words. All pages had common design features 
(please refer to Appendix B). 

Instruments 


English Language Placement Test 

In order to guarantee the close homogeneity of the groups, the Oxford Placement Test was 
administered to the participants. The test, which is a commercially developed package, is 
claimed to grade and place students reliably into appropriate levels. It is divided into two 
sections, listening and grammar, which take about an hour to complete. The results are 
interpreted by referring to the test manual. By reference to a 1 2-column table of level 
specifications, students can be assigned to levels within the OPT Band, OPT Score, OPT 
Language Level, Common European Framework Level, ALTE & QPT, UK NQF level, 
IELTS, Cambridge ESOL Main Suite, Cambridge BEC, Cambridge CELS, TOEFL, and 
TOEIC. 

Word Recognition Test 

The tests evaluated the participants on the learning of the target words by presenting them 
with a written definition. The students had to choose a suitable definition for each target 
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word from the same number of alternatives plus two distracters. The definitions were phrased 
differently from those used in reading passages, although they conveyed the same meaning. 
Likewise, the picture recognition test required the participants to choose a related picture for 
each target word. The pictures were also different from the ones used in the study even 
though they conveyed the same meaning. Such a safeguard was taken to avoid the 
participants’ memorizing the definitions as well as pictures encountered in the course of 
reading (please refer to Appendix A). 

The use of the picture recognition test was primarily based on the studies by Kost et al. 

(1999) and Yoshii and Flaitz (2002). Since this study replicates these studies in some 
respects, I felt it necessary to include the test as accurately as possible. It might be interesting 
to see how pictures can convey the meaning of words, or how pictures can help tap 
participant’s understanding of the meaning of the words, but one wonders how valid this type 
of test is, since learners might not normally take such a test in real life situations. 

Procedure 


One week before the study, a standardized English placement test was administered to the 
volunteers. Once the researcher made certain that the participants formed a homogenous 
sample, a pretest examining the knowledge of the target words was administered. The 
participants were presented with a list of 32 words and were instructed to put a check mark 
by each word they knew and write down a short definition or synonym in English or Farsi for 
the checked words. Subsequently, the words which were defined correctly by the participants 
were discarded from the initial pool of target words, resulting in the elimination of seven 
words. When the final participants as well as the target words were identified, the 
participants were randomly divided into three groups of 30 and the texts encompassing the 
target words were glossed and made online. Three versions of the same texts were designed, 
each displaying one type of gloss, textual, pictorial or textual-pictorial (combination), as 
definitions for the target words. During three sessions of instruction, 100 minutes each, the 
texts were worked on a computer site. For each group, the first session was allocated to the 
demonstration of the learning medium. The participants were introduced to different parts 
and components of the website including the entry for reading texts, glosses, test pages, and 
operating the website. 

In the second session, the reading of materials under each condition followed. To each 
reading passage along with its accompanying test 30 minutes was devoted, 15 minutes to 
each activity, through a countdown function on the website. The redirection behavior 
assigned to each page did not interrupt the participants throughout the two activities; 
therefore, the participants managed to finish each activity within the allocated time. The 
glosses could be consulted by placing the mouse pointer over the colored boldface words. 

The following snapshot shows the first reading text in the combination mode of research as 
the word “pony” has been consulted: 
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Two Funny Stories about Fritz 


T 

o understand one of my favorite stories, you have to know that my 



My mother was always askng rre if I had b 
every meal she would say, ’Did you brush your 
a snack, just some cookies or a piece of fruit 
had brushed my teeth. 1 decided that Fritz sho. 
Naturally, I thought I should clean them for hirr 
toothpaste and brushed his teeth Fritz just trv 
lips back so I could brush real wr II. Then, I wo 
on rrry brush and begn again. Fritz would roll c 
mouth. He was so happy when I brushed he tie 


We had many different pets when I was 
definitely the most interesting. 


family owned a very tig car. It was a convertible. When the top was 


down, Fritz, rrry pony, could fit Into the back 01 
ice cream, we aKvJKv took him with us when ■*> 
You can imagine tt\ oaks we got from the pec 
down the street with a pony on the back seat e 


ra- Uoca r r itr Igufld 


Table of Contents 
i eet 


Figure 1. Consulting the Word “pony” in the Combination Mode of Research 

When the reading task finished, the participants were redirected to the test page where they 
were presented with the two main testing instruments, word and picture recognition tests, 
and, as a safety measure, two reading comprehension items to avoid the participants’ 
guessing the main concern of the research. Although the test items were displayed on the 
screen, the participants were to answer the questions on the answer sheets which was 
distributed towards the end of the reading task. The students were not allowed to look at the 
text while they worked on the vocabulary tests. 



Two Funny Stories about Fritz l est 


f trmA nj O mm (IkirtAimi 

| wirin’ 

7 VhrddtMifMlaitrutfiRn'iMI 


1 . canw- tt* 

2 pery IP) * ba w » * it d» <rtJ « 

1. con* [c] • hart* trmM tmfr 

i inti U)*cjr lc»tyli»bt n»»*l 

5 co»0* I#)b**c>r 

4 pn ff) *v l«pi tr mwpamr^fi or *rm»wnl 

lg) ta» M l 4 oar of * 
p>) 4 Mp f**r'Ow -tiwy 


dn Kgp to* pot* 



Figure 2. The Test Page 
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Once the answer sheets were collected, the participants proceeded to the next reading text. 
Accordingly, by the end of the second session, the participants had studied three texts. The 
third session followed the same procedure and the two remaining texts were studied. 

Results 

The data were analyzed using the one-way ANOVA statistical analysis as performed in the 
environment of the software SPSS 15.0 for Windows. For all the analyses, the alpha level 
was set at .05. 

English Language Placement Test Results 

Even though the 140 volunteers had been chosen from a population of Interchange intro 
students, they were further given a standardized English placement test. As was stated earlier, 
the 90 students scoring within the range 105-119 were chosen as the final pool of participants 
in the study. 

Posttest Results 


After finishing each reading text, the three groups were tested on the immediate recall of the 
target words via the two instruments, word and picture recognition tests. 

Word Recognition Test 

The word recognition tests were evaluated based on the number of correct responses. To each 
correct choice one point was assigned. Comparing the mean scores revealed a contrast in the 
performance of the three groups, with the combination group (M = 24.17) outperforming 
both the pictorial group (M = 20.37) and the textual group (M = 17.07). 

In order to further investigate whether the differences among the means were statistically 
significant, a one-way ANOVA analysis was performed on the data. The results indicated that 
significant differences existed in the performance of the groups, F(2, 87) = 91.77, p < .05. 

The results of a post hoc Scheffe test indicated that group means significantly differed for the 
three conditions in the study, that is, the combination group outperformed the other two 
groups on the word recognition test (M =24. 1 7, SD = 1 . 1 1), and the pictorial group (M = 
20.37, SD = 2.37) outperformed the textual group (M = 17.07, SD = 2.34). 

Picture Recognition Test 

The same scoring procedure and statistical analyses were employed in evaluating the picture 
recognition test. By comparison, the performance of the groups on this test revealed 
something of an actual difference. The trends in the analysis indicated that significant 
differences existed in the performance of the groups, F(2, 87) = 335.99, p < .5. Likewise, the 
combination group outperformed the other two groups on the picture recognition test (M = 
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24.73, SD = .52), and the pictorial group (M = 23.53, SD = 2.06) outperformed the textual 
group (M = 13.83, SD = 2.24) on this measure. 

Posttest Results bv Research Question 

Is there any significant difference in the incidental vocabulary learning of the participants 
when exposed to three different modes of multimedia annotations in the course of reading? 

As was demonstrated earlier, the pictorial group outperformed the textual group on both 
measures. Significant differences were also found in the performance of the two groups on 
word recognition as well as picture recognition tests. Moreover, the textual-pictorial group 
outperformed the textual group on both measures and significant differences were also found 
in the performance of the two groups. Furthermore, it was found that the textual-pictorial 
group still outperformed the pictorial group on the two measures, although the mean 
differences were comparatively less than those of the textual group, with the picture 
recognition test marking the least difference. 

Moreover, as a measure of the dispersion of a statistical population, the standard deviation of 
scores in the three groups was indicative of the effectiveness of the combination gloss type as 
well. 


Table 2. The Standard Deviation of Scores Obtained by the Three Groups 


Groups 

Word 

Recognition 

Test 

Picture 

Recognition 

Test 

Textual 

2.34 

2.24 

Pictorial 

2.37 

2.06 

Textual Pictorial (Combination) 

1.11 

0.52 


Table 2 shows the cross-tabulation of the standard deviations of the scores obtained by the 
three groups on the word and picture recognition tests. As can be seen, the standard deviation 
of scores obtained by the combination group was, by comparison, smaller on both measures, 
indicating that scores tended to be closer to the mean. Also, the variability of scores was 
smaller on the picture recognition test. 

Discussion 

The findings of this study confirmed the previous findings (Al-Seghayer, 2001; Chun & 
Plass, 1996; Yeh & Wang, 2003; Yoshii & Flaitz , 2002). The results suggested that a 
combination of textual and pictorial glosses was more beneficial to the learners, possibly due 
to with the fact that they received two modes of input (Ellis, 1994), namely verbal and visual. 
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The results of this study are similar to those of Yoshii and Flaitz (2002) in that the 
combination group outperformed the other two groups. In their study; though, the differences 
between the pictorial group and the textual group were not so significant except as regards 
the picture recognition test in which the pictorial group had an advantage over the text-only 
group. This could be due to the fact that Yoshii and Flaitz (2002) were basically examining 
incidental vocabulary learning via pictorial glosses as “simple line drawings designed to be 
as culturally and linguistically neutral as possible for foreign language instruction” (p. 38) 
that were attached to 14 verbs. Interestingly, the two variables in these studies, that is 14 
verbs vs. 25 concrete nouns and simple line-drawings vs. high-quality- images, could have 
potentially resulted in the relatively larger difference found in the performance of the textual 
and pictorial groups in this study. Research suggests that learning concrete nouns is easier 
than learning abstract nouns (Kess, 1992; Whitney, 1998) and that learning nouns is easier 
than learning verbs (Ellis, 1994). Besides, the quality of pictures, which could be simply the 
use of color pictures or the use of real-life shots, must have been more effective in triggering 
the memory and resulting in sounder incidental vocabulary learning. Therefore, the results 
seem reasonable. 

An interesting finding regarding the performance of the pictorial group was that this group 
performed virtually the same (M=23.53) as the combination group (M=24.73) on the picture 
recognition test; while, the textual group had a significantly lower mean (M=18.83). This 
finding seems logical in that the pictorial group was exposed to pictorial glosses, even though 
the pictures in the test were different from those the students observed in the glosses attached 
to the target words. Though it was expected that the textual group would, in turn, outperform 
the pictorial group on the word recognition test, the reverse turned out to be the case and the 
pictorial group still outperformed the textual group. It is worth mentioning that incidental 
vocabulary learning is, by comparison, more effective with the use of pictures. 

Regarding the variability of scores, it was determined that the combination group had an 
advantage over the other two groups on both word and picture recognition tests. This is 
further evidence to support the idea that pictures help foster incidental vocabulary learning. 

On the whole, the two instruments indicated that the combination of the two glossing 
techniques, namely textual and pictorial, was most influential in helping the participants with 
learning incidental vocabulary. 

Implications and Applications 

The rationale for using glosses as reading aids is that they free up learners’ working memory 
by providing the bottom-up function of processing unknown words (Chun, 2006). 
Furthermore, the provision of such learning aids will make unnecessary continual dictionary 
searches and the resultant interruption in the course of learner reading. 

Some studies (see Knight, 1994; Krashen, 1993) corroborate the idea that reading a text with 
the purpose of comprehension will help learners retain vocabulary incidentally. The study 
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reported here confirmed the previous findings. Furthermore, it also revealed that learners will 
indeed learn significantly better when they are provided with more input presentation modes. 
This is in line with the Dual-coding Theory (Paivio, 1971, 1990), which states that 
information coded both verbally and visually is more effective for learning than information 
coded in either form. 

The use of marginal glosses has been deemed influential in removing the potential risks of 
learning words incidentally (Hulstijn et al., 1996) as shown in printed materials. The present 
study indicated that online glosses can, indeed, alleviate the problems linked with incidental 
vocabulary learning in online materials as well. This study has some implications for both 
syllabus designers and decision makers. Since one of the characteristics of vocabulary 
learning is the sheer size of the task, any means that can lighten this burden for students 
should be appreciated. Not only are intentional means of learning vocabulary needed, but 
also incidental ones should be depended upon (Nation, 1999). 

Utilizing computers, multimedia and IT has proven to be influential in language teaching in 
general, and incidental vocabulary learning in particular. Therefore, equipped with sound 
theoretical knowledge, material designers may create appropriate CALL programs which can 
promote learning, and subsequently those programs can be used in language classrooms. Of 
course, such programs ought to be designed based on sound theoretical and pedagogical 
principles (see Lee, Owens, & Benson, 2002; Plass, 1998). 

Language teachers might find the results of this study useful in that it provides further 
evidence for the importance of multimodality of input presentation. Since the added 
significance of glosses as teaching and learning aids in incidental vocabulary learning was 
reconfirmed in this study, teachers might rely upon CALL to unify the two and, thus, enhance 
the learning experience for language learners. In case language teachers might lack the 
training or time to write CALL programs, there are some reliable applications available that 
can be used to gloss reading texts. 

Suggestions for Further Research 

This study did not distinguish between the learning styles of participants. The rich literature 
on Individual learner Differences (IDs) suggests, “there is a particularly wide variation 
among language learners in terms of their ultimate success in mastering an L2” (Dornyei, 
2005, p. 6). Therefore, there is a need to carry out the same study taking into account the 
participants’ learning styles. Some learners might be visualizers, getting more advantage 
from pictures; while others, verbalizers, might benefit more from textual materials (Ellis, 
1994). 

Research suggests that learners with large vocabularies gain benefits more from marginal 
glosses (Jacobs et al., 1994). As a result, it could be valuable for another study to examine the 
incidental vocabulary learning of more proficient language learners through the same 
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procedure. Though one of the unique characteristics of this study was the fact that it was 
carried out with low-level Iranian English language learners. 

This study investigated the immediate incidental vocabulary gains of participants. There is a 
need to further assess the delayed retention of target words after a one/two-week period. 

This study examined the incidental learning of twenty-five concrete nouns. Aside from the 
fact that twenty-five is too small a sample to provide us with airtight proof, a similar study is 
needed to investigate abstract nouns. Multimedia software can potentially provide more 
versatile tools for portraying qualities which cannot be drawn in printed materials. 

This study also controlled for gender. A similar study could investigate the effect of the three 
annotation types on the incidental vocabulary learning of female students. 

There were a large number of comments, feedbacks, and gestures from students that resulted 
from throughout the events encountered during the experiment. Elowever, the task of running 
this quantitative study did not allow the researcher to appreciate such invaluable pieces of 
qualitative data. It is suggested that another study focus on qualitative aspects of teaching and 
learning with multimedia CALL programs in general, and multimedia glosses in particular. 

Conclusion 

This study investigated the effectiveness of three multimedia annotation types, namely 
textual, pictorial, and textual pictorial, on the incidental vocabulary learning of 90 adult 
Iranian ELL learners at the elementary level. Like the previous research carried out in the 
field (Al-Seghayer, 2001; Chun & Plass, 1996; Yeh & Wang, 2003; Yoshii & Llaitz, 2002), 
the results indicated that a combination of text and still images resulted in significantly better 
incidental vocabulary learning. This study confirms that “electronic dictionaries and software 
that provide textual, contextual, and/or multimedia annotations” are part of “main 
technologies” which support specific components of reading, especially incidental 
vocabulary learning (Chun, 2006, p.l), and that multimodality (Guichon and McLoman, 
2008) in CALL strongly enhances incidental vocabulary learning. 
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Appendix A. Instruments 
A. Word Recognition Test 

1. Two funny stories about Fritz 


1 . convertible 

[a] small quick meal 

2. pony 

[b] something that narrows to a point from a circular flat base 

3. cone 

[c] a horse with small body 

4. snack 

[d] a car that has top that can be folded or removed 

5 . cookies 

[e] biscuit 

6. pet 

[f] tame animal 


[g] lowest floor of a building 


[h] deep narrow steep-sided valley 


2. Tornado 


1 . tornado [a] violent destructive storm with circular winds 


2. storm 


[b] period of very strong winds, rain, etc. 



3. funnel 1 

4. basement 

5 . floor 

6. ravine 


[c] tube that is wide at the top and narrow at the bottom 

[d] lowest floor of a building, below ground level 

[e] deep narrow steep-sided valley 

[f] surface of a room on which one stands and walks 

[g] sea animal with a soft body and eight arms 

[h] container, usually made of glass, with a wide top 


3. Ultimate: An Exciting Sport 


1 . disk 

[a] 

2. teammate 

[b] 

3. referee 

[c] 

4. coach 

[d] 


[e] 


m 

4. Running to 

Win 

1 . athlete 

[a] 

2. bike 

[b] 

3. competition 

[c] 

4. ocean 

[d] 


organ in the body that controls thought, feeling, etc. 

person who buys something in a shop 

person who trains an athlete in a sport 

person who does a teamwork 

person who controls a game 

the thin flat circular plate that is easy to throw 


shelter made of canvas, etc. and supported by poles and ropes 
exciting or dangerous journey or activity 
lowest floor of a building, below ground level 
person trained for physical games 


[e] event in which people compete 



[f] (infml) short for BYCYCLE 


5. Can Animals Think 


1 . brain 

2. octopus 

3. crab 

4. jar 

5 . mammal 


[a] deep narrow steep-sided valley 

[b] organ in the body that controls thought, feeling, etc. 

[c] sea animal with a soft body and eight arms 

[d] ten-legged shellfish 

[e] container, usually made of glass, with a wide top 

[f] any of the kind of animal of which the female feeds her young 
with milk from her body 

[g] mass of very high rock with steep sides 


B. Picture Recognition Test 

1. Two Funny Stories about Fritz 


A B C I) E F G 



2. Tornado 


G 




H 





3. Ultimate: An Exciting Sport 



Appendix B. Treatment Materials 
1. Two Funny Stories About Fritz 

To understand one of my favorite stories, you have to know that my 
family owned a very big car. It was a convertible. When the top was 
down, Fritz, my pony, could fit into the back of the car. Since Fritz loved 
ice cream, we always took him with us when we went out for ice cream. 
You can imagine the looks we got from back seat eating an ice cream 







cone! My mother was always asking me if I had brushed my teeth. Every 
time I had a snack, just some cookies or a piece of fruit, she wanted to 
know if I had brushed my teeth. I decided that Frit should have clean teeth 
too. Naturally, I thought I should clean them for him. So, I got out my 
toothpaste and brushed his teeth. Fritz just loved it. We had many 
different pets when I was growing up, but Fritz was definitely the most 
interesting. 


Flesch Readability Statistics: 
Passive Sentences: 0% 

Fleck Reading Ease: 79.9 
Readability: Fairly Easy 


2. Tornado 

With the arrival of the tornado season, the National Weather Service is 
again telling people how to protect themselves from these deadly storms. 
The winds from tornados are the most violent winds on earth. They can 
blow up to 400 miles per hour. A tornado looks like a funnel; it is also 
very loud. It may sound like a train coming at you. In fact, the winds from 
a tornado can pick up a section of a train and throw it around, if a tornado 
is seen in your area, it is very important that you protect yourself. A 



basement is the safest place to go. Try to wait under a table in the 



basement. If your building does not have a basement, stay on the ground 
floor but lie flat under a bed or table. Stay away from windows. If you are 
outside or in your car, try to find a ravine to lie down in. 


Flesch Readability Statistics: 
Passive Sentences: 8% 

Fleck Reading Ease: 75.0 
Readability: Fairly Easy 


3. Ultimate: An Exciting Sport 


Ultimate is becoming a very popular sport. It is played in more than forty- 
two countries all over the world. It is played with a disc that looks like a 
Frisbee. But the Ultimate disc is larger and heavier than a Frisbee. The 
purpose of the game is to score goals. A goal is scored when a teammate 
catches the disc in the end zone. The first team to score fifteen goals wins 



the game. The basic rules of Ultimate are easy to learn. Each team has seven 
players on the field. Players cannot run while they are holding the disc. 
They must throw the disk to another player on their team to move it down 
the field. They must not let the disc touch the ground. It is a foul if a player 
touches another layer to prevent him or her from catching the disc. A foul is 
an action that is against the rules. There are no referees in Ultimate. The 



players on the filed make the decisions about what is fair. There are usually 
no coaches in Ultimate. The combination of trust and competition makes 
ultimate the ultimate sport to many people. 

Flesch Readability Statistics: 

Passive Sentences: 16% 

Fleck Reading Ease: 76.7 
Readability: Fairly Easy 


4. Running to Win 

Martha Sorensen loves to run. In high school and college, she won many 
races. Now, at age thirty-three, running is a big part of her job. So are 
swimming and bicycling. Martha is a professional athlete. She competes in 
triathlons. A triathlon has three parts. First, there is a long swim. Then the 
athletes come out of the water and ride their bikes. When they finish riding 

their bikes, they have to run for miles! It is a very difficult sports 

competition. Three years ago, Martha went to Hawaii. She joined athletes 
from around the world. They competed in the famous Ironman Triathlon 
there. First, the athletes swam almost a mile in the ocean. Then they rode 
112 miles (180.2 kilometers) by bike. Finally, they ran a 26.2-mile (42.16 
kilometer) race. Martha did very well. She finished that race in about ten 
hours. Only six women were faster than Martha. 



Flesch Readability Statistics: 

Passive Sentences: 0% 

Fleck Reading Ease: 61 .0 
Readability: Standard 

5. Can Animals Think? 

There are many stories about many different things animals can do, but are 
they true? Can animals really think? Now, many of scientists believe that 
some animals have the brain power to understand new situations, make 
decisions, and plan ahead. In Italy, scientists showed that an octopus could 
learn how to perform a task by watching another octopus do it. In this 
experiment, an octopus who did not know how to open ajar to get to a crab 
inside was allowed to watch another octopus who did know how. After 
observing how the second octopus did it, the first octopus was able to open 
the jar himself. Until recently, many scientists had thought that only 
mammals could learn by watching others. But, as some scientists say, we 
need to conduct a lot of experiments to give a firm answer to this question. 

Flesch Readability Statistics: 

Passive Sentences: 12% 

Fleck Reading Ease: 60.3 


Readability: Standard 


Appendix C. Readability of Two Interchange Intro Texts 

1. Two Special Houses in the American Southwest (P. 49) 

In San Antonio, Texas, there is a purple house. This house is the home of Sandra 
Cisneros. Ms. Cisneros is a Mexican- American writer. She is famous for her interesting 
stories. The house has a porch with a pink floor. The rooms are green, pink, and purple. 
There are many books and colorful paintings. Many other houses near Ms. Cisneros’s 
house are white or beige, so her house is different. Some of her neighbors think her house 
is too colorful, but Ms. Cisneros loves it. 

Passive Sentences: 0% 

Flesch Reading Ease: 64.4 
Flesch-Kincaid Grade Level: 6.5 
Readability Level: Standard 

2. Job Profiles (P. 55) 

Lisa Parker has two jobs. She works as a waitress at night, but she’s really an actress. 
During the day, she auditions for plays and television shows. Her schedule is difficult, 
and she’s tried a lot. But she’s following her dream. Lots of teenagers want John Blue’s 
job. He plays video games for eight hours a day. And he gets paid for it! John is a video 
game tester for a big video game company. Is it ever boring? Never. John almost always 
wins. Becky walks in the park every day for many hours - rain or shine. Becky is a 
professional dog walker. She walks dogs for other people. Sometimes she takes 20 dogs 
to the park at one time. Carlos Ruiz is a busy man. He plans lessons, grades homework, 



helps with after-school activities —and of course, he teaches! His salary isn’t great, but 


that’s OK. His students like his class, so he’s happy. 


Passive Sentences: 5% 

Flesch Reading Ease: 69.6 
Flesch-Kincaid Grade Level: 5.4 
Readability Level: Standard 



